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Abstract 

We consider the min-cost multicast problem (under network coding) with multiple correlated sources 
where each terminal wants to losslessly reconstruct all the sources. This can be considered as the net- 
work generalization of the classical distributed source coding (Slepian-Wolf) problem. We study the 
inefficiency brought forth by the selfish behavior of the terminals in this scenario by modeling it as a 
noncooperative game among the terminals. The solution concept that we adopt for this game is the pop- 
ular local Nash equilibrium (Wardrop equilibrium) adapted for the scenario with multiple sources. The 
degradation in performance due to the lack of regulation is measured by the Price of Anarchy (POA), 
which is defined as the ratio between the cost of the worst possible Wardrop equilibrium and the socially 
optimum cost. Our main result is that in contrast with the case of independent sources, the presence of 
source correlations can significantly increase the price of anarchy. Towards establishing this result we 
make several contributions. We characterize the socially optimal flow and rate allocation in terms of 
four intuitive conditions. This result is a key technical contribution of this paper and is of independent 
interest as well. Next, we show that the Wardrop equilibrium is a socially optimal solution for a different 
set of (related) cost functions. Using this, we construct explicit examples that demonstrate that the POA 
> 1 and determine near-tight upper bounds on the POA as well. The main techniques in our analysis 
are Lagrangian duality theory and the usage of the supermodularity of conditional entropy. Finally, all 
the techniques and results in this paper will naturally extend to a large class of network information flow 
problems where the Slepian-Wolf polytope is replaced by any contra-polymatroid (or more generally 
polymatroid-like set), leading to a nice class of succinct multi-player games and allow the investigation 
of other practical and meaningful scenarios beyond network coding as well. 

1 Introduction 

In large scale networks such as the Internet, the agents involved in producing and transmitting information 
often exhibit selfish behavior e.g. if a packet needs to traverse the network of various ISP's, each ISP will 
behave in a greedy manner and ensure that the packet spends the minimum time on its network. While this 
minimizes the ISP's cost it may not be the best strategy from a overall network cost perspective. Selfish 
routing, that deals with the question of network performance under a lack of regulation has been studied 
extensively (see [20^25]) and has developed as an area of intense research activity. However, by and large 
most of these studies have considered the network traffic injected into the network at various sources to be 
independent. 
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From an information theoretic perspective there is no need to consider the sources involved in the trans- 
mission to be independent. In this work we initiate the study of network optimization issues related to the 
transmission of correlated sources over a network when the agents involved are selfish. In particular, we 
concentrate on the problem of multicasting coiTclated sources over a network to different terminals, where 
each terminal is interested in losslessly reconstructing all the sources. We assume that the network is capable 
of network coding. Under this scenario, a generalization of the classical Slepian-Wolf theorem of distributed 
source coding 1 14] holds for arbitrary networks. In particular, when the network performs random linear 
network coding each terminal can recover the sources under appropriate conditions on the Slepian-Wolf re- 
gion and the capacity region of the terminals with respect to the sources, thereby allowing distributed source 
coding over networks. The selfish agents in our set-up are the terminals who pay for the resources. Each 
terminal aims to minimize her own cost while ensuring that she can satisfy her demands. It is important to 
note that this is a generalization of the problem of minimum cost selfish multicast of independent sources 
considered by Bhadra et al. Q. 

1.1 Our Results 

In this work, we model the scenario as a noncooperative game amongst the selfish terminals who request 
rates from sources and flows over network paths such that their individual cost is minimized (i.e. with no 
regard for social welfare) while allowing for reconstruction of all the sources. We investigate properties 
of the socially optimal solution and define appropriate solution concepts (Nash equilibrium and Wardrop 
equilibrium) for this game and investigate properties of the flow-rates at equilibrium. We briefly describe 
our contributions below. 

i) Characterization of social- optimality conditions. The problem of computing the socially optimal cost 
is a convex program. We present a precise characterization of the optimality conditions of this con- 
vex program in terms of four intuitive conditions, using Lagrangian duality theory and by judiciously 
exploiting the super-modularity of conditional entropy. This result is a key technical contribution of 
this paper and is of independent interest as well. 

ii) Demonstrating the equivalence of flow-rates at equilibrium with social-optimal solutions for alter- 
native instances. We consider certain meaningful market models that split resource costs amongst 
the different terminals and show that the flows and rates under the game-theoretic equilibriums are in 
fact socially optimal solutions for a different set of cost functions. This characterization allows us to 
quantify the degradation caused by the lack of regulation. The measure of performance degradation 
due to such loss in regulation that we adopt is the Price of Anarchy (POA), which is defined as the 
ratio between the cost of the worst possible equilibrium and the socially optimum cost |[T5ll22ll26ll25]| . 

iii) Showing that source correlation induces anarchy. The main result of this work is that the presence 
of source correlations can significantly increase the POA under reasonable cost-splitting mechanisms. 
This is in stark contrast to the case of multicast with independent sources, where for a large class of 
cost functions, cost-splitting mechanisms can be designed that ensure that the price of anarchy is one. 
We construct explicit examples where the POA is greater than one and also obtain an upper bound on 
the POA which is near tight. 

Finally, we expect that the techniques developed in the present work will be applicable to a large class 
of network information flow problems with correlated sources where the Slepian-Wolf polytope is replaced 
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by polymatroid-like objects. These include multi-terminal source coding with high resolution ll28l and the 
CEO problem Ell. 

1.2 Background and Related Work 

Distributed source coding (or distributed compression) (see |7 1, Ch. 14 for an overview) considers the prob- 
lem of compressing multiple discrete memoryless sources that are observing correlated random variables. 
The landmark result of Slepian and Wolf |[27]| characterizes the feasible rate region for the recovery of the 
sources. However, the problem of Slepian and Wolf considers a direct link between the sources and the 
terminal. More generally one would expect that the sources communicate with the terminal over a network. 
Different aspects of the Slepian- Wolf problem over networks have been considered in (ll2l!Hll24J). Network 
coding (first introduced in the seminal work of Ahlswede et al. [ 1]) for correlated sources was studied by Ho 
et al. lll4l . They considered a network with a set of sources and a set of terminals and showed that as long as 
the minimum cuts between all non-empty subsets of sources and a particular terminal were sufficiently large 
(essentially as long as the Slepian-Wolf region of the sources has an intersection with the capacity region of 
a given terminal), random linear network coding over the network followed by appropriate decoding at the 
terminals achieves the Slepian-Wolf bounds. 

The problem of minimum cost multicast under network coding has been addressed in the work of |[T9l 
[TSl . The multicast problem has also been examined by considering selfish agents Il5l[l6lll7l- Our work is 
closest in spirit to the analysis of Bhadra et al. lH that considers selfish terminals. In this scenario, for a 
large class of edge cost functions, they develop a pricing mechanism for allocating the edge costs among the 
different terminals and show that it leads to a globally optimal solution to the original optimization problem, 
i.e. the price of anarchy is one. Their POA analysis is similar to that in the case of selfish routing Il26ll25l . 
Our model is more general and our results do not generalize from theirs in a straightforward manner. In 
particular, we need to judiciously exploit several non-trivial properties of the Slepian-Wolf polytope in our 
analysis. 

Further, motivated by the need to deal with selfish users, particularly in network setting, there has been 
a large body of recent work at the intersection of networking, game theory, economics, and theoretical 
computer science ll20l l4l[T3]|. This work adds another interesting dimension to this interdisciplinary area. 

2 The Model 

Consider a directed graph G = {S\JT\JV, E). There is a set of source nodes S that may be correlated and a 
set of sinks T that are the terminals (i.e. receivers). Each source node observes a discrete memoryless source 
Xi. The Slepian-Wolf region of the sources is assumed to be known and is denoted 3i§w- For notational 
simplicity, let Ns = \S\,Nt = \T\, 5 = {1, 2, ... , Ns}, and T = {ti, t2, • • • , iAfrl- The set of paths from 
source s to terminal t is denoted by Vg^t- Further, define "^t = Us^s'^s^t i-S- the set of all possible paths 
going to terminal t, and J" = [Jter'^t, the set of all possible paths. A flow is an assignment of non-negative 
reals to each path P G J". The flow on P is denoted fp. A rate is a function R : S x T — > i.e. the 
rate requested by the terminal t from the source s is Rg^f We will refer to a flow and rate pair (/, R) as 
flow-rate. Also, let us denote the rate vector for terminal t by Rt and the vector of requested rates at source 

sby i.e. Rt = {Ri,t,R2,t, ■ ■ ■,RNs,t) and = {Rsfy,Rs,t2,- • ■,Rs,tNj,)- 

Associated with each edge e € is a cost Ce, which takes as argument a scalar variable Zg that depends 
on the flows to various terminals passing through e. Similarly, let dg be the cost function corresponding to the 
source s, which takes as argument a scalar variable ys that depends on the rates that various terminals request 
from s. These functions Cg's and d/s are assumed to be convex, positive, differentiable and monotonically 
increasing. Further, the functions / ^^^^ dx are also convex, positive, differentiable and monotonically 
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increasing. In particular, these conditions are satisfied by functions like x", a > 1 and xe , b > among 
others. 

The network connection we are interested in supporting is one where each terminal can reconstruct all 
the sources, i.e. we need to jointly allocate rates and flows for each terminal so that it can reconstruct the 
sources. We now present a formal description of the optimization problem under consideration. 



2.1 Min-Cost Multicast with Multiple Sources 

Let us call the quadruple [G, c, d, 3?sw) an instance. 
instance (G, c, d, 3?sw) can be formulated as 



The problem of minimizing the total cost for the 



minimize C(/, R) = ^ Ce{ze) + ^ 

e&E seS 

subject to /p > VP G y 

(NIF-CP) /P > Vs G 5,Vt G T (1) 

PeTs,t 

Rt G 3isw Vt G T 

where Ze, Ve G is a function of Xe,ti , Xe,t2 Xe,tN^ , that we denote Ze{xe,ti , Xefy Xe,tj^^ ) with 
Xe,t = Z^pg^j-eeP fp ^ ^ ^'^'i Vs G 5 is a function of that we will denote ys{Ps)- 

The formulation above is similar to the one presented in (F]. However since we consider source cor- 
relations as well, their formulation is a specific case of our formulation. Since network coding allows the 
sharing of edges, the penalty at an edge is only the maximum and not the sum i.e. Ze is the maximum flow 
(among the different terminals) across the edge e. Similarly, the penalty at the sources for higher resolution 
quantization is also driven by the maximum level requested by each terminal i.e. ys is also maximum. In 
this work, for differentiability requirements the maximum function will be approximated as Lp norm with a 
large p. Nevertheless, most of our analysis is done where z^, and ys are non-decreasing functions partially 
differentiable with respect to their arguments, such that Ce{ze) and ds{ys) are convex, positive, differentiable 
and monotonically increasing. Note that in the formulation above, the objective function is convex and all 
constraints are linear which implies that this is a convex optimization problem. 

The constraint ([T]) above models the fact that the total flow from the source s to a terminal t needs to 
be at least i?s,t- Finally, the rate point of each terminal Rt needs to be within the Slepian-Wolf polytope. 
A flow-rate (/, R) satisfying all the conditions in the above optimization problem (i.e. (NIF-CP) ) will be 
called a feasible flow-rate for the instance (G, c, d, 3?sw) and the cost C(/, R) will be referred to as the 
social cost corresponding to this flow -rate. Also, we will call a solution (/*, i?*) of the above problem as 
an OPT flow-rate for the instance (G, c, d, 3l§-w)- 

Consider a feasible flow-rate (/, R) for the above optimization problem. It can be seen that the value of 
the flow from ^ C S" to a terminal t G T is X^pgu gA^s t — SsgA ^s,t- Since Rt G 3isw the result of 
|[14 1 shows that random linear network coding followed by appropriate decoding at the terminals can recover 
the sources with high probability. Conversely the result of lITll |2]| shows the necessity of the existence of 
such a flow. 



2.2 Terminals' Incentives and the Distributed Compression Game 

The above formulation for social cost minimization for the instance (G, c, d, 3isw) disregards the fact that 
the agents who pay for the costs incurred at the edges and the sources may not be cooperative and may have 
incentives for strategic manipulation. In this work we consider the scenario where the terminals pay for 
the network resources they are being provided. The terminals are noncooperative and will behave selfishly 
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trying to minimize their own respective costs without regard to the social cost, while ensuring that they can 
reconstruct all the sources. We have the following assumptions. 



(i) Let (/, i?) denote a feasible flow rate for the instance (G, c, d, 3?§-w)- The network operates via 
random linear network coding (or some practical linear network coding scheme) over the subgraph of 
G induced by the corresponding {zg} for e ^ E. The terminals are capable of performing appropriate 
decoding to recover the sources. 

(ii) Each terminal t ^ T can request for any specific set of flows on the paths P £ Tt and rates Rj as 
long as such a request allows reconstruction of the sources at t. There is a mechanism in the network 
by means of which this request is accommodated i.e. the subgraph over which random linear network 
coding is performed is adjusted appropriately. 

In this work we wish to characterize flow-rates that represent an equilibrium among selfish terminals 
who act strategically to minimize their own costs. Furthermore, we shall systematically study the loss that 
occurs due to the mismatch between the social goals and terminal's selfish goals. 

Towards this end, we now formally model the game originating from the selfish behavior of the ter- 
minals. We model this game as a normal formal game or strategic game \\2U , which we refer to as the 
Distributed Compression Game(DCG). 

A normal form game, denoted (N, {Ajjjgj^, {hi}ie7<), consists of the set of players >J", the tuple of set 
of strategies Ai for each player i e !N, and the tuple of preference relations for each player i G ]M" on the 
set A = Xi^-y^Ai. For a,h £ A, a b means that the player i prefers the tuple of strategies a to the tuple 
of strategies h. In the context of Distributed Compression Game, given an instance (G, c, d, ^R§w)> these 
parameters are defined as follows. 

2.2.1 The Distributed Compression Game 

• Players: = T, i.e. the terminals are the players. This is because, as mentioned above, the terminals 
are the users and they are the ones who pay for the network resources they are being provided. 

• Strategies: The strategy set of a player t e T consists of tuples {ft,Rt) where 



- ft is the vector of flows on paths going to t, i.e. the vector of values fp for all P ^^t, and recall 
that Rt denotes the rate vector for terminal t; 



Note that a feasible flow-rate (/, R) for the instance (G, c, d, 3?sw) is an element of the set A = 
Xt^xAt defined for the same instance. 

• Preference Relations: To specify the preference relation of terminal t G T, we need to know how 
much does she pay given a feasible flow-rate (/, R) i.e. what fractions of the costs at various edges 
and sources are being paid by t? To this end, we need market models, i.e. mechanisms for splitting 
the costs among various terminals. 



- /p > VP G Tt, EpeVs, fp > Rs,t yseS and Rt G 3?sw. 



Therefore, 
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- Edge Costs: At a flow /, the cost of an edge e G is Ce{ze). It is split among the terminals 
t ^ T, each paying a fraction of this cost. Let us say that the fraction paid by the player t 
is ^!e,t{xe) i-e. the player t pays Ce{ze)^ e,t{xe) for the edge e where Xe denotes the vector 
{xe,ti,Xe,t2: • • • ) Xe^tNj,)- course, J2t&T ^e,t{^e) = 1 to ensure that the total cost is borne 
by someone or the other. The total cost borne by t across all the edges is J2eeE (^e{ze)'^e,t{xe), 
denoted Cg^(/). 

- Source Costs: At a rate R, the cost for the source s is dg (y^), which is split among the terminals 
t & T, such that t pays a fraction ^s,t{ps) i-^- player t pays ds{ys)^s,t{Ps) for the source 
s. Of course, J2teT ^s,t{ps) = 1- Therefore, the total cost borne by t for all sources, denoted 

4*^(^)>is Esesds{ys)^sAPs)- 

Thus, with the edge-cost-splitting mechanism ^ and the source-cost-splitting mechanism the total 
cost incurred by the player t € T at flow-rate (/, R) denoted C^*) (/, R) is 

C^'\f,R) = cf{f) + cf{R) 

= Ce{Ze)'^e,t{Xe) + ^ ds{ys)^s,t{ps)- 

Now, each terminal t would like to minimize its own cost i.e. the function C^^^ (/, R) and therefore the 
preference relations {ht} are as follows. For two flow-rates (/, R) ^ A and (/, R) € A, (/, R) 
(/, R) if and only if (/, R) < C^*) (/, R). Also, (/, R) yt (/, R) iff (/, R) < C(*) (/, R). 

Note that for specifying a Distributed Compression Game, in addition to the parameters G, c, d and "Rsw 
we also need the cost-sphtting mechanisms ^ and We will call {G, c, d, 3l§Vi7, ^, as an instance of the 
Distributed Compression Game. 

2.2.2 Solution Concepts for the Distributed Compression Game 

We now outline the possible solution concepts in our scenario. These are essentially dictated by the level 

of sophistication of the terminals. Sophistication refers to the amount of information and computational 
resources available to a terminal. In this work we shall work with two different solution concepts that we 
now discuss. 

a) Nash Equilibrium. The solution concept of Nash equlibrium requires the complete information setting 
and requires each terminal to compute her best response to any given tuple of strategies of the other players. 
For notational simplicity, let /_( be the vector of flows on paths not going to terminal t i.e. the vector of 
values fp for all P G J" — ^t, therefore / = (/-t, ft)- Similarly, R^t is the vector of rates corresponding to 
all players other than t, therefore R = {R-t, iit). In our setting, the best response problem of a terminal t 
is to minimize her cost function C^*) {f-t, ft, R-t, Rt) over {ft, Rt) G At given any {f-t, R~t)- Therefore 
a Nash flow-rate is defined as follows. 

Definition 1 (Nash flow-rate) A flow-rate (/, R) feasible for the instance (G, c, d, iR§w) is o,t Nash equi- 
librium, or is a Nash flow-rate for instance {G, c, d, Olgy^, $), if^t G T, 

C^'\f,R) < C^'\f^t,ft,R-t,Rt) yCft,Rt) e At. 

We note that computing the best response will in general require a given terminal to know flow assignments 
on all possible paths and rate vectors for all the terminals. Moreover, convexity of the objective function 
in NIF — CP (i.e. social cost C(/, R)) does not imply convexity of C^^\f^t, ft, R-t, Rt) in the vari- 
ables {ft, Rt) G At in general. Therefore the computational requirements at the terminals may be large. 
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Consequently Nash equilibrium does not seem to be an appropriate solution concept for the Distributed 
Compression Game when viewed through the algorithmic lens. 

b) Wardrop Equilibrium. From a practical standpoint, a terminal may only have partial knowledge 
of the system and may be computationally constrained. A solution concept more appropriate under such 
situations is that of local Nash equilibrium or Wardrop equilibrium that is widely adopted in selfish routing 
and transportation literature Il25l [3ll9l. We note that this solution concept has also been utilized in |5] and 
is further justified in [llj . We first present the precise definition of the Wardrop equilibrium in our case and 
then provide an intuitive justification. Towards this end, we need to define the marginal cost of a path. 

Definition 2 (Marginal Cost of a Path) For a P Tt its marginal cost is 

Cp{f) = ^ CeiZe)"^ e,t{Xe) 

Therefore, for the terminal t, the total cost for the edges, C^^ , can be equivalently written as 



cf{f)= E Cp{f)fp. 



Definition 3 (Wardrop flow-rate) A flow-rate (/, R) feasible for the instance {G, c, d, 3?siv) o-t local 
Nash equilibrium, or is a Wardrop flow-rate for instance (G, c, d, 3i§-W) ^) if it satisfles the following 
conditions. 

1. Vt G T, Vs € S, we have 

2. Vt G T, we have 

Y,Rs,t = H{Xs). 

3. Vt eT,\fs £ S, P,Q e 'J's,t with fp > 0, 

Cp{f) < CqU). 

4. For t £ T, let j £ S participates in all tight rate inequalities involving i £ S (i.e. if A ^ S, such that 
i £ A and X^^g^ Ri^t = ^(-^Al^-AyQ. then j € A) and let P G IPj^t, Q G Tjt with fp>0 then we 
have 

^^(^^ + -^R-r - ^""^^^ + ~dR-r- 

Intuitively, conditions (1) and (2) require that each terminal requests as little rate and flow as possible. 
Condition (3) ensures that an infitesimally small change in flow allocations from path P (where fp > 0) to 
path Q where P,Qg ^3,1, will increase the sum cost along paths in Tt. Now, consider an infitesimally small 
change in flow allocation from P G Ti^t (where /p > 0) to Q G "Pj^f This also requires a corresponding 
change in the rates requested from sources i and j by terminal t. Under certain constraints on the source 
j, Condition (4) ensures that the overall effect of this change will serve to increase terminal t's cost. The 



'We use H{Xa\X-a) and H{Xa\Xa':) interchangeably in thie text to denote the joint entropy of the sources in set A given 
the remaining sources. 
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conditions on the source j are well-motivated in light of the characterization of Nash flow -rate in section [5] 
in the case when the best response problem of every terminal is convex. 

We remark that a Nash flow-rate may not always be a Wardrop flow-rate and vice versa. When sources 
are independent, condition (2) implies that Rs^t = H{Xs) for all s £ S,t £ T and it is not required to 
check the condition (4). Also we can recover condition (3) by setting i = j in condition (4). They are stated 
separately for the sake of clarity. 

As we discussed earlier, the solution concept based on Wardrop equilibrium seems more suitable to our 
scenario and consequently we define the price of anarchy [,15i,.22.,25,l in terms of Wardrop flow-rate instead 
of Nash flow-rate. 

Definition 4 Price of Anarchy(POA): Let Q be a class of edge cost functions, D be a class of source 
cost functions, S be a class of networks/graphs, ^ be an edge cost splitting mechanism, ^ be a source 
cost splitting mechanism, and JA be a set of Slepian-Wolf polytopes. We will refer to (9, C, 2), ^, M) 
as a scenario. The price of anarchy for the scenario (S, C, 2), M), denoted /9(S, C, 23, ^, M), is 
defined as maximum over all instances (G, c, d, 3i§w) with G G S, c S C, d € 2), 3isw S M, of the ratio 
between the cost of worst possible Wardrop flow -rate for the instance {G, c, d, Uigw, ^, and the cost of 
OPT flow-rate (i.e. the socially optimal cost) for the instance (G, c, d, 3isw)- That is, 

p(g,e,2),*,$,M)= max ( ^^^(f,R)i^aWar,ropflo.-rateforiG^^^^ C{f,R) \ 

Ggg,cge,deD,Kswe3vc V Copt(G, c, d, 2lsw) / 

where Copt{G, c, d, 3?sw) refers to the optimal cost of NIF — GP for the instance {G, c, d, 3?sw)- 

Let us denote the set of Slepian-Wolf polytopes corresponding to the case where there are no source 
correlations (i.e. H{Xa\X^a) = H{Xa) for all A £ S)by M-ind (subscript ind denotes - independent) 
and the set of Slepian-Wolf polytopes corresponding to the case where sources are correlated (i.e. there 
exists A <Z S with H{Xa\X^a) < H{Xa)) by Mc. Also, we use Sail to denote the class of all graphs 
where every i € T is connected to every s £ S, and 5dsw (subscript dsw denotes - direct Slepian- 
Wolf) to denote the class of complete bipartite graphs between the set of sources and the set of terminals. 
Note that Sdsw corresponds to the case where every terminals is directly connected to every source by 
an edge and no network coding is required. A question we will be most concerned with in this work is 
whether /9(g,e, 2),*, ^>,Mc) > /9(g, C, D, Mi„d), and in particular whether p(g, C, D, ^f, Mc) > 
1 but p(g, C, 2), Mjjjd) = 1 for meaningful classes of cost functions 6,23 and reasonable splitting 
mechanisms ^' and ^ i.e. does correlation induce anarchy? 

3 Some Properties of Slepian-Wolf Polytope 

In this section, we establish two properties of Slepian-Wolf polytope that will be useful in the latter sections. 

Lemma 5 Let Rt G 3?§w i.e. Y,ieA ^i.t > H{XA\X^A)for all A(^S. If Si, S2 Q S satisfy 

Y,Ri,t = HiXsAX-s,) 
leSi 

and 

Y,Rht = H{Xs,\X-s,) 

l€S2 
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then we have 

Rl,t = H{XsinS2\^-{SinS2)) 

and 

-Ri,t = i^(X5lU52l^-(SlUS■2))• 
^€5lU52 

Proof: We have, 

«e5inS2 «e5iu52 /e5i ieS2 

= HiXsAX-s,)+H{Xs2\X-S2) 

< H{Xs^nS2\X_(^SinS2)) + H{XsiuS2\X-(SiuS2)) 

where in the second step we have used the supermodularity property of conditional entropy. Now we are 
also given that 

Rl,t> H{XsinS2\X-{SinS2)) 

leSinS2 

and 

^ Ri^t> H{Xs^uS2\X-iSlUS2))■ 
^eSlUS2 

Therefore we can conclude that 

ieSiLiS2 

and 

XI Ri,t = H{XsinS2\X-(SinS2))- 
ieSinS2 

Theorem 6 Consider a vector i?2, • • • , Rn) such that 

X > H{Xa\Xac), for all ^ C {1, 2, . . . ,n}, and 

n 

X > H{Xi,X2, . . . ,Xn). 

i=l 

Then there exists another vector (R'i,!^, . . . , i?^) such that < Rifor alii = 1, 2, ... n and 

X -Ri > H{Xa\Xa-), for all A C {1, 2, ... , n}, and 

ieA 

n 

X -Rj = H{Xi,X2, ■ ■ ■ ,Xn). 
i=l 

Proof We claim that there exists a Rj* G {Ri, R2, . . . ,Rn} such that all inequahties in which Rj* partici- 
pates are loose. The proof of this claim follows. 
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Suppose that the above claim is not true. Then for all Ri where i € {1,2,..., n}, there exists at least 
one subset 5*4 C {1, 2, ... , n} such that, 

Y,Rk = HiXs^Xs^). 
keSi 

i.e. each Ri participates in at least one inequality that is tight. 

Now by applying Lemma [5] on the sets ^i, 52, . . . , Sn, since U 52 ■ ■ ■ U 5„ = {1, 2, ... , n}, we 

get Er=i Ri = Eie5iU52-u5„ Ri = H{Xs,uS2-usjX^(^SrUS2-uSn)) = H{Xi,X2, . . . ,X„), which is a 
contradiction. 

The above argument shows that there exists some j* such that all inequalities in which Rj* participates 
are loose. Therefore we can reduce Rj* to a new value Rj^"^ until one of the inequalities in which it 
participates is tight. If the sum-rate constraint is met with equality then we can set R'j, = Rj^'^ otherwise 
we can recursively apply the above procedure to arrive at a new vector that is component-wise smaller that 
the original vector (iii, i?2, ■ ■ ■ ,Rn)- 



4 Characterizing the Optimal Flows and Rates 

In this section, we investigate the properties of an OPT flow -rate via Lagrangian duality theory 161. Since 
the optimization problem (NIF-CP) is convex and the constraints are such that the strong duality holds, 
the Karush-Kuhn-Tucker(KKT) conditions exactly characterize optimality Q. Therefore, we start out by 
writing the Lagrangian dual of NIF-CP, 

ee-E ses PeT seSteT pg^s,* 



Y,VA,t Ih{Xa\Xa^ 



YRi,t 



ACS 



i£A 



where fip > 0,Xs,t > and UA^t > are the dual variables (i.e. Lagrange multipliers). For notational 
simplicity, let us denote the partial derivative of Ze with respect to Xe,t, by Zg^. Note that the partial 
derivative of Xg ^ w.r.t. to fp is 1 for a P € Tf Similarly, we denote the partial derivative of ys with 
respect to P^ j, by y'^ ^. The KKT conditions are then given by the following equations that hold 

ysGS,teT, 



= Yl ^e{Ze)ZeAXe) - flp 
■'^ eGP 



A 



s,t 



0, VP G T,., and 



(3) 



= d2ys)y's,t{ps) + - V i^A,t = (4) 

along with the feasibility of the flow-rate (/, P) and the complementary slackness conditions, fipfp = 
for all PeT, Xs,t{Rs,t - Epg^, tfp)=^ ^""^ all s G 5, t G T, and UA,t {H{Xa\Xac) - ^^^^ Ri,t) = 
for all A C 5, t G T. 

Let us now interpret the KKT conditions at the OPT flow-rate (/*,P*). Suppose that fp>0 for 
P G Ts^t- Then due to complementary slackness, we have fi*p = and consequently from equation Q 
we get YleGP'^eiK)^eti^e) — Kt i-^- if there exists another path Q G Tg^t such that /g > then 

SeGP "^6(^6)^6,* (^e) = X^eGQ '^e(^e)^e,i(^e)- 
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Now if we interpret the quantity ^ggp c^{Z(.)z^i{X(.) as the differential cost of the path P associated 
with the flow-rate (/, R) then this condition implies that the differential cost of all the paths going from the 
same source to the same terminal with positive flows at OPT is the same. It is quite intuitive for if it were 
not true the objective function could be further decreased by moving some flow from a higher differential 
cost path to a lower differential cost one without violating feasibility conditions, and of course this should 
not be possible at the optimum. Similarly, the differential cost along a path with zero flow at OPT must have 
higher differential cost and indeed this can be obtained as above by further noting that the dual variables 
/ip's are non-negative. We note this property of the OPT flow -rate in the following lemma. 

Lemma 7 Let {f*,R*) be an OPT flow -rate for the instance {G,c,d,Jisw)- Then, Vt G T, Vs € S, 
P,Q£ ^s,* with fp > Owe have 

eeP eeQ 

The above lemma provides a simple and intuitive characterization of how the flow allocations on var- 
ious paths of same type (that is originating at same source and ending at the same terminal) behave at the 
optimum solution. Although such a simple and intuitive characterization of the behavior of joint flow and 
rate allocations at optimum is not immediately clear, we can indeed obtain three other simple and intuitive 
conditions that together with Lemma |2l are equivalent to the KKT conditions. We establish this important 
characterization in the Theorem [TT] First, we will show in the next three lemmas that these conditions are 
necessary for optimality. 

Lemma 8 Let (/, R) be an OPT flow-rate for the instance {G, c, d, 3isw)- For t G T, suppose that there 
exist i,j £ S that satisfy the following property. If A C S, such that i £ A and X^;g^ Ri,t = H(Xa\X-a), 
then j € A. For such i and j let P E ^i^t, Q £ '?j,t with fp > 0. Then 

eGP eeQ 

Proof: Since (/, R) is an OPT flow-rate, it satisfies the KKT conditions for some suitable choice of dual 
variables Xi^t > 0, /Up > 0, VA.t > 0. Now, we are given that j £ A for all A C such that i £ A 
and J2i£A^i-,t ~ H{^a\X^a), so if there is an j4 C S" such that i £ A but j ^ A then Yli£A^i,t > 
H{Xa\X-a) and therefore by complementary slackness we get VA,t = 0. Further, from Equation |4j we 
have 



d'i{yi)y'i,tipi) + = Yl ^^'^ 



A<ZS:i^A 

l^A,t 



E 



ACS:i€A,j£A 

ACS:i&A,j^A 



(since ^ UA,t = 0) 



and 

d'jiyj)y'j,tipj) + ^J't = Yl ^^'i 

ACSij&A 



= Y ^^^^ + Y, 

AC5:jGA,ieA AQS:j£AAiA 

> Y "^'i 

ACS:j£A,i(^A 

= d!i{yi)y'iApi) + Ki- 



ll 



Therefore we get, 

d'i{yi)y'i,tiPi) + ^i,t < dj{yj)y'j^t{pj) + >^i,t- 

Furthermore, we are given that fp > which, using Equation [3] and complementary slackness condition 
fpl_ip = 0, implies that Ai,t = J2eeP ^'ei^e)z'e,t{x e) a^rid since fig > we have X^eeQ cl(-Ze)4,t(=Ke) > >^j,t- 
Therefore, 

eeP e6Q 

This concludes the proof. 

Lemma 9 Let {f,R) be an OPT flow-rate for the instance (G, c, 3?§w) wherein the functions c^'s and 
da's are all strictly convex, then \/t € T, Vs S S, we have 'YlpeVs t ~ 

Proof: Let X]pgy^ t fp > Rs,t then there is a P E Pg^t with fp > 0. Define a new feasible flow / such that 
fg = fg^iQ P and fp = fp-6 for some <S < min{/p, XIpgt,,^ fp - Rs,t}- Then, 

eeE egP e^P 

= ^ Ce(Ze) + ^ (CeiZe) - Ce{Ze)) 
e&E e&P 

Now, since the functions Cg is non-decreasing as well as z^, is non-decreasing in each co-ordinate, we get 
Ce{ze) — Ce{ze) < Oforall 6 £ P. Therefore, 

eG_E eG-E 



C{f,R) = Y.Ce{z,)+Y,ds{ys 

e&E seS 



e£E seS 

= C{f,R) 

which is a contradiction because (/, R), due to strict convexity of the function C, is the unique OPT flow- 
rate. 

Lemma 10 Let (/, R) be an OPT flow-rate for the instance {G, c, d, 3l§w) wherein the functions Cg's and 
dg's are all strictly convex, then Vt E T, we have X^^gg Rs,t = H{Xs)- 

Proof: As R is feasible, Vt £ T, Rt £ 3i§w and therefore, J2seS ^^,1 > H{Xs)- Suppose ^g^g Rs,t > 
H{Xs) for some t £ T, then from Theorem [6] there exist an s £ S, such that all (Slepian-Wolf) inequalities 
in which R^^t participates are loose. Therefore, we can decrease this rate R^^t by a positive amount r i.e. 
to Rs^t = Rs,t — T, without violating feasibility. This means that we can define a feasible rate R such that 

Ri,t = Ri.t if i 7^ s and Rs^t = Rs,t — r for some r > 0. Now, 

"^diiiJi) = "^diivi) {ds{ys) - ds{ys)) 
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Now, since dg is non-decreasing as well as i/s is non-decreasing in each co-ordinate, we get dg {ys ) < ds{ys)- 
Therefore, 

^di{yi) < ^di{yi) =^ 

C{f,R) = Y.Ce{Ze)+Y.ds{ys) 

< '^Ce{Ze) +'^ds{ys) 

e€E seS 

= C{f,R) 

which is a contradiction because (/, R), due to strict convexity of the function C, is the unique OPT flow- 
rate. 

Theorem 11 A feasible flow-rate {f,R)forthe instance (G, c, d, ^^gw), which satisfles the following four 
conditions is an OPT flow -rate for the instance {G, c, d, 3isw)- Also, there is always an OPT flow-rate that 
satisfies these four conditions. Further, when the edge cost functions Cefor all e ^ E and the source cost 
functions dg for all s ^ S are strictly convex, that is when the optimization problem (NIF-CP) is strictly 
convex, these conditions are also necessary for optimality. 

1. \/t e T, Vs € S, we have 

2. Vt G T, we have 

J2Rs,t = H{Xs). 

3. Vt €T,\fs e S, P,Q e ■J's,t with fp > 0, 

e&P eeQ 

4. For t T, suppose that there exist i,j G S that satisfy the following property. If A Q S, such that 
i ^ A and 'Yliit^A Rl,t ~ ^i-^Al^-A), then j E A. For such i and j let P E ^Pj.t, Q E Vj^t with 
fp>0. Then 

Yc^ize)ze,tixe) + d[{yi)y[tipi) < YceiZe)Ze,tiXe) + d'jiyj)y'j/pj). 

Proof: We prove that the above four conditions imply optimality of (/, R). Our assumptions guarantee that 
the optimization problem (NIF-CP) for the instance (G, c, d, 3isw) is convex and since all the feasibility 
constraints are linear, strong duality holds [6|. This implies that the KKT conditions are necessary and 
sufficient for optimality. We show that a feasible flow-rate (/, R) with the above four properties satisfies the 
KKT conditions for the instance {G, c, d, 3i§w) for a suitable choice of the dual variables given below. 
Choosing Aj ^'s: 

^ eeP 
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Note that, using Condition 3, for i G S*, if there exist a Pi G Ti^t such that fp. > then we have 



Choosing /xp's: For P G Pj ^ take 



eePi 



eeP 



Choosing ua/s- Let 

:= di{yi)yi,t{pi) + \t- 

Let vr denote a permutation such that < /i7r(i)_t < h^(2),t ^ • • • ^7r(Afs),i- Now take 

f /i,(i),iif A = Ml),^(2),...,^(iV5)} 
^7r(j),t - K{i-i),t ifA= {7r(i), . . . , 7r(A^5)} 
and 2 < i < iV^ 

otherwise. 



Now, with the above choice of dual variables we will check all the KKT conditions one by one. 
Dual Feasibility: 

• K,t > as Ce and are non-decreasing functions i.e. Cg(ze) > and z'^ ti^e) > 0. 

• MP ^ by the definition because Xi^t < J2eeP '^e(^e)-zl,t(a^e) VP E Pi^f 

• i^A.t > by definition. 

KKT Conditions as per equation |3t 

= Yl ^e{Ze)z'e,t{Xe) - h,t - /^P 
•'^ eGP 



eGP VegP / 



= 0. 

KKT Conditions as per equation Ht 



dL 



ACS:ir{i)€A 

ACS:n{i)eA 

K{t),t - Y '^{7r(i),7r(i+l),...,7r(Afs)},t 
je{l,2,...,i} 

K{i),t - [K{l),t + {K{2),t - K{l),t) 

+{K{3),t - K(2),t) H ^ {K(i),t - K{i-i),t)_ 

^7r(i),t - K{i),t = 0. 



Complementary Slackness Conditions: 
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• npfp = OforallP G 

Let P G Ti t and /p > then using Condition 3 and definition of Xi^t we get 

eGP 

and tfierefore, 

Atp = X] cl(^;e)4,t(a^e) - Ai,t = 0. 

eeP 

. A,,t(i?,,t - Epev,,t fp) = for all s G 5, t € T. 
This follows from the Condition 1. 

. i^A,t {H{Xa\Xac) - E^eA ^m) = for all A C 5, t G T. 

Note that VA,t = except for A = {7r(i), 7r(i + 1), . . . , 7r(A'^5)}, for i = 1, 2, . . . , Ns- Therefore the 
only condition that needs to be checked is that if 

Y,f=i ^i^{j),t > -f^(-'^7r(j) ) -'^7r(i+l) ) • • • ) -'^7r(Afs)l^7r(i-l)) ■ ■ ■ ) -'^7r(l))> then /l7r(i),t " K{i-l),t = 0. 

Towards this end let j G {7r(i), 7r(i + 1), . . . , 7r(A^5)}, and let Aj be the minimum cardinality set such 
that j G Aj and Y^i^A, Ri,t = H{Xa^\X^a,) i.e. 

Aj = arg min \A\. 

ACS:j€A,j:i^^Rl^t=H{XA\X_A) 

Such a set Aj always exists because from Condition 2 we have J2i^Ji ^i,t = H{Xi , . . . , X^^ ) and therefore 

the set {A C 5 : j G ^, Y^k^a Ri,t = H{Xa\X^a)} is not empty. 

We claim that there exists a j* G {7r(i), 7r(i + l), . . . ,ir{Ns)} such that ^dj. n{7r(l), 7r(2), . . . ,7r(i-l)} 

is not empty. If this is not true then clearly we have u'^^^^j^Aj = {7r(i), 7r(i + 1), . . . , tt^Ns)} and using 
the supermodularity property of conditional entropy (ref. Lemma H)), we obtain 

j=TT{i) 

which is a contradiction, therefore we must have such a j* G {7r(i), 7r(i + 1), . . . , 7r(A^s)} such that Aj* n 
{7r(l), 7r(2), . . . , 7r(i — 1)} is not empty. 

Next, we show that there exists a source k G {7r(l), 7r(2), . . . ,7r(i — 1)} such that if j* G A and 
"Yi^A ^i,t — H{Xa\X^a), then k ^ A. Towards this end suppose that there exist subsets 5*1 and S2 of S 
such that f G 5i n 52 and Y.i&s^ Ri,t = H{Xs, \X^Si) and j2ieS2 ^ht = HiXs^ l^-Sa). then using the 
supermodularity property of conditional entropy we can show that rate inequality involving 5i fl S2 is also 
tight ( LemmaH) i.e. EieSinSa ^'-t = H{Xs^nS2 \X-(SinS2))- This imphes that Aj* , being of minimum 
cardinality, is the intersection of all sets that have j* as a member on which the rate inequality is tight i.e. 

Aj* = f] {A:feA,Y, Ri,t = H{Xa\X^a)}- 
ACS leA 

Moreover note that Aj* is not a singleton set since Aj* n {7r(l), 7r(2), . . . , 7r{i — I)} ^ (p. Therefore there 
exists a A; G Aj* such that k j*. By our above arguments this implies that if A C 5 is such that j* G A 
and EzeA Ri,t = H{Xa\X^a) then k £ A. 

Clearly, Rj*,t > H{Xj*\X^j*) as k does not participate in this rate inequality. Therefore, Rj*^t > 
which implies that there exists a P G '?j*,t with fp > 0, therefore using Condition 3 and the definition of 
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Aj*_t we have YleeP '^'ei^e)'^'e ti^e) = '^j*,t- Also, by the definition of X^^t there is a Q G Tfc^t such that 
Now using Condition 4, we get 




which imphes that 



Aj*,t + d'j,{yj*)y'j,/pj*) < Xk,t + d'kiyk)y'k,tipk) 



and therefore we get hj*^t < Now note that k G {7r(l), 7r(2), . . . , 7r(i— 1)} while j* G {vr(i), . . . , 7r(A^5)}. 
This implies in turn that t < hj*^t < hk,t- But, we know that /i^^t < /i^(j„i) ^ i.e. K{i)^t-K{i-i),t < 
but we akeady have - > and hence h^i^^t - /i^(i_i),t = 0. 

This establishes that the four conditions are sufficient for optimality. Further, as per Lemmas 171 [8l l9l [TOl 
under strict convexity conditions, these conditions are necessary too. 

Corollary 12 If the sources are independent (i.e. Hgy^ G Mj^^j, there is a feasible flow-rate for instance 
(G, c, d, 3isw) that is an OPT flow-rate for both the instances {G, c, d, 3i§w) cind (G, c, d, 3isw). where 
Ce{x) = aCe{x) for constant a > 0, and ds is any convex, differentiable, positive and non-decreasing 
function. Further, this OPT flow-rate satisfles the four conditions in Theorem\T7\for both the instances 
(G, c, d, D^sw) <^nd (G, c, d, 3isw)- 

Proof: The idea is that when the sources are independent. Condition (2) in Theorem [TT] implies that 
Rs^t = H{Xs) for all s G S,t G T, and therefore, there is no pair (z, j) such that j participates in all 
tight rate inequalities involving i and consequently it is not required to check Condition (4). For the sake of 
completeness the proof follows. 

Let (/, R) be an OPT flow-rate for (G, c, d, 3i§w) satisfying the four conditions in Theorem [TT] Note 
that such an OPT flow-rate always exists as per Theorem [TT] Since the sources are independent the rate 
inequalities constraints becomes 



Now we will show that (/, R) is also an OPT flow-rate for the instance (G, c, d, 3isw) by showing that 
it satisfies the four conditions in Theorem [TT] for instance (G, c, d, 3isw)- Note that Conditions (1) and (2) 
are easily satisfied by (/, R) as they do not depend on particular cost functions. Further, 



Ri,t > H{Xa) for all ^ C 5, t G T. 



Therefore, using Condition (2) in Theorem [TT] we obtain 



Rs,t = H{Xs) for aU s G 5, t G T. 




therefore condition 




is equivalent to 
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therefore condition (3) is also satisfied. For the condition (4), let us first note that as discussed above 
Rs^t = H{Xs) for all s € 5, t € T. This implies that there is no pair (i, j) € S x S satisfying the promise in 
condition (4) i.e. there is no pair such that j participates in all tight rate inequalities involving i (simply 
because j does not participate in the tight rate inequality ( = H{Xi)). Thus, {f,R) satisfies all the 4 
conditions in Theorem [TT] for the instance {G, c, d, 3i§-w) and hence is an OPT flow-rate for [G, c, d, 3?sw)- 

5 The Flows and Rates at Nash Equilibrium 

In this section, we study the properties of a Nash flow-rate whenever the individual optimization problem 
(i.e. the best response problem) of each terminal is convex, that is whenever Nash equilibrium can be 
considered as an appropriate solution concept for the Distributed Compression Game when viewed through 
the algorithmic lens. Therefore, throughout this section, we assume that the edge cost splitting mechanism 
^, as well as, the source cost splitting mechanism <I> are such that the functions C*-*-*, for all t G T, are 
convex. By considering the best response problem of each terminal, and an approach essentially the same as 
in the Section|4]for characterizing OPT flow-rate, we can obtain the following Theorem[T3]for characterizing 
Nash flow-rate. 

Theorem 13 Consider an instance {G,c,d,'R$Y},^ where C*-*^ is convex for all t ^ T. A feasible 
flow-rate (/, R)for the instance (G, c, 3?§w). which satisfies the following four conditions is a Nashfiow- 
rate for (G, c, d, ^§w, ^, <&). Further, when C^*^ is strictly convex for all t (z T, these conditions are also 
necessary. 

(1) yt & T, Ms ^ S, we have 

(2) Vt € T, we have 

J2Rs,t = H{Xs). 

(3) yte T, Vs G 5, P, Q G ys,t with fp > 0, 

dclifl^dclifl 
dfp - dfg ■ 

(4) For t £ T, let j (z S participates in all tight rate inequalities involving i G S (i.e. ifAC S, such that 
i £ A and X^/g^ Ri^t = H(Xa\X^j{), then j G A) and let P G ^i^t, Q G '^j,t with /p > then we 
have 

dC^SU) ^ dcfjR) ^ dGfU) ^ dcf{R) _ 
dfp dRi^t ~ dfg dRj^t 

Further, under similar convexity conditions, we can also show that a Nash flow-rate always exists for 
the Distributed Compression Game. This is done via first compactifying the strategy sets At's to obtain a 
restricted game where existence of a Nash equilibrium follows from the standard fixed point theorems fTP\. 
Then, by utilizing the monotonically non-decreasing properties of various cost functions, it is argued that 
a Nash equilibrium of the restricted game is also a Nash flow-rate for our Distributed Compression Game 
thereby proving the existence of a Nash flow-rate for Distributed Compression Game. 

The Theorem [14] in the following is a very standard and popular result on the existence of Nash equilib- 
rium and we adopt it from the book by Osborne and Rubinstein EH. 



17 



Theorem 14 The strategic game (N, (Ai) , (^j)) has a Nash equilibrium if for all i G !N, the following 
conditions hold. 

a) The set Ai of actions of player i is a nonempty compact convex subset of a Euclidean space. 

b) The preference relation is continuous and quasi-concave on A^. A preference relation on A 
is said to be quasi-concave on Ai if for every a A the set {oj € A^ : {a^i, hi) a} is convex. A 
preference relation on A is said to be continuous if a b whenever there are sequences {a^} and 
{h^} with a^,b^ A and for all k such that {a''} and {b^} converge to a and b respectively. 

Now, let us consider an instance {G, c, d, DigWi ^) of the Distributed Compression Game, where C^*^ 
is convex for all t ^ T. 

The action set of the terminal t G T is 

r /p > VP € y^, ] 

At =l{ft, Rt) ■■ Epey., fp > Rs,t Vs G 5, I . (5) 
[ Rt G 3Jsw J 

Clearly this is a nonempty convex subset of an Euclidean Space, but it is not compact. 
Let us consider a game with a restricted set of strategies denoted At 's as follows and let us call this new 
game as the restricted game for the instance {G, a, d, 3?§-w, 4', <I>). 



At 



/P > VP G J>t, 

{ft,Rt) ■ Rt G 3^§w, 

fp < H{Xs) VP G Tt, 
Rs,t < H{Xs) Vs G 5 



> • (6) 



Now the set At becomes compact as it is a closed and bounded subset of an Euclidean space, and 
therefore At satisfies the requirement (a) of the Theorem [141 

Since players' cost functions C^*^ are convex and continuous for all t G T, the condition (b) in the 
Theorem [14] is also satisfied and we obtain the following result. 

Lemma 15 The restricted game for the instance {G, c, d, 3isw, where C^*^ is convex for all t £ T, 

admits a Nash equilibrium. 

Now we claim that every Nash equilibrium of the restricted game is also a Nash equilibrium for the 
original game and that will imply the existence of a Nash flow-rate for the original game. 

Lemma 16 Every Nash equilibrium of the restricted game for the instance (G, c, d, 3lsw; ^) where C^*^ 
is convex for all t ^T, is also a Nash flow-rate for the instance {G, c, d, 3?sW) *^*)- 

Proof: Let (/, R) be a Nash equilibrium of the restricted game for the instance {G, c, d, "Jlgy^, 
Then, for all t we have 

G^'\f,R)<C^'\f^t,R^tJt,Rt) 
for all ft, Rt feasible for the restricted game i.e. coming from the restricted strategy set At. 
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Now let (ft, Rt) ^ At\At i.e. ft, Rt is feasible for the original game but not feasible for the restricted 
game. For ease of notation, let us define the following quantities. 

Si,t = {seS:Rs,t>H{Xs)} , S2,t = S\Si,t 
i?; = {<, ■.= HiXs)\sGSi,t} 

7l = [peyt:fp>HiXs)} , a'2 = T,\Ti 

ft = {f'p:=HiXs)\Pe'J'l} 

Note that in defining R^ and fl we have projected all the flows and rates violating the feasibility for the 
restricted game to their boundary values and therefore the strategy {f^, {fp : P G y^},-??^, {Rs,t '■ s € 
S2,t}) G At i.e. it is feasible for the restricted game. 
Now, 

C^'\f.t,R-tJt,Rt) > C^'\f-t,R-tJt,R'tARs,t : s e ^2,^) 

> R^t, f't, {fp:Pe y?}, i?;, {R^^ : s e S2,t}) 

and since (/, R) is a Nash equilibrium for the restricted game and {ft, {fp : P G Tt}, Rt, {Rs,t '■ s G S2,t}) 
is feasible for the restricted game we have 

CW(/, R) < C^'\f-t, R-t, ft, {fp-P^ y?}, I^t, {Kt ■■ s G ^2,*}) 
< C^'\f^t,R-t,ft,Rt) 

and therefore C^*\f,R) < C'^^\f_t,R^t, ft,Rt) for all {ft,Rt) e At implying that {f,R) is a Nash 
equilibrium of the original game meaning (/, R) is a Nash flow-rate for the instance {G, c, d, 3iswi ^) ^) 
Combining the Lemmas [15] and [16] we obtain the following theorem. 

Theorem 17 An instance {G, c, d, 3?§W) where C^*-* is convex for all t £ T, admits a Nash flow-rate. 

6 Wardrop Flow-Rate and the Price of Anarchy 

In this section, we investigate the inefficiency brought forth by the selfish behavior of terminals. First, we 
will show that the Wardrop equilibrium is a socially optimal solution for a different set of (related) cost 
functions. Using this, we will construct explicit examples that demonstrate that the POA > 1 and determine 
near-tight upper bounds on the POA as well. We start out with the characterization of Wardrop flow-rate. 

Theorem 18 Let Ze{xe) = [Y.taT^'^t) " > ^e,t(a;e) = „ n and ^s,t{ps) = W?- ^ Wardrop flow - 

rate for {G, c, d, 3?sW) ^) an OPT flow-rate for {G, c, d, 3?§-w). where Ce{x) = Nt J ^^^^dx. Further, 
when the edge cost functions Cefor all e £ E and the source cost functions dg for all s £ S are strictly 
convex, an OPT flow -rate for {G,c,d,Jl$'w) is also a Wardrop flow-rate for (G, c, d, ^Jgw, ^, where 

Ce{x) = -^XCg(j;). 

Proof: We will show that the definition of a Wardrop flow-rate for instance {G, a, d, Ol^y^, <I>) exactly 
corresponds to the four conditions for the instance {G, c, d, 3isw) in Theorem [TT] 
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We have, 




Therefore, 



Ce{Ze)— 

eeP ^ 

^ eGP 



where the last equaUty follows from the fact that 



Also, 



i&S 



dC^g\R) 1 J 



dRLt Nt 



Therefore, 



.eeP 



The result follows from the equivalence of conditions coming from Definition [3] and Theorem [TT] 

In contrast with the result of [5 J that holds for a single source with the edge cost splitting mechanism 
used above, from Theorem \TS\ we can note that for most reasonable cost splitting mechanisms, the POA 
will not equal one for all monomial edge cost functions. We construct explicit examples for POA > 1 in 
the Figures [T] and |2] The example in Figure [T] is near tight as will be evident from an upper bound on POA 
derived in Theorem [20l 

It is interesting to note that in the case when sources are independent, in the Wardrop or OPT solutions, 
the rates requested at various sources will equal their respective lower bounds (i.e. their entropies). There- 
fore, the cost term corresponding to the sources will be fixed, and one only needs to find flows that minimize 
the edge costs. In this situation, it is not hard to see that the POA will again equal one for all monomial edge 
cost functions, i.e. it is the correlation among the sources that is responsible for bringing more anarchy. 
We formalize this below. 

Let Cjt = {c : Ce(x) = aeX^ , Ue > 0, Ve € E} be the set of edge cost functions where all edge cost 
functions are monomial of the same degree k possibly with different coefficients, and Qrrwn = ^k>iQk- 
Similarly, Vk = {d : di{y) = hy^ , hi > 0, Vs G S}. Also, let D 

convex — {cZ : is convcx Vi € 5"}. 
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Corollary 19 Correlation Induces Anarchy: Let Ze{X(.) 

(Z^teT ^""^ '^'s.tlPs) = then we have 

1- p{?)alli^mom'^convexi^ 1^ I'^^ind) — 
convex ; 

p(9a««, Cmon, 2?co™ex, ^, ^, Mc) > 1/or Zflrgg va/wei o/m and n. 
In fact, p(g,ii,ei,2)2,^,«>,Me) > 

4- piSdsw, Qmon, "^convex, Mc) > 1/or large values ofm and n. 

Proof: Let c G Cmon i-S- Ce(x) = OgX^ for Og > for all e G E, therefore, J ^^f^dx = J UeX^^^ dx = 
cie-^x'^ = j:Ce{x). Also, d G Dconvex- Now, since the sources are independent (i.e. Disw £ from 
Theorem [T8] and Corollary [T2]it follows that a Wardrop flow-rate for instance {G, c, d, Ol$y^, $) is also an 
OPT flow-rate for the instance {G, c, d, 3^§w) which imphes that piSaih^mon, "^convex, ^, '^ind) = 1- 

Even if the sources are correlated, when we have k = Nt, we have Nt J ^^^^dx = Ce(x) and using 
Theoremim a Wardrop flow-rate for instance (G, c, d, 3i§W) ^! ^) is also an OPT flow-rate for the instance 
(G, c, d, 3isw) which implies that 

pi9 all, QNt,'D convex, ^,'^,^c) = 1- 

We prove p{9aii. Si, 2)2, Mc) > and consequently 

by explicitly constructing an example as provided in Figure [T] All sources are identical with entropy h, 
therefore, 3i§w G Mc. Let ds{y) = Ciy^ for all s G S, therefore, d G D2, and the edge cost functions, 
Cf,{x) = X except for the edge {u,v) for which Ce{x) = C2 x. Therefore, c G Ci. Let us consider the 
following flow-rate (/, R) 

R^ t = h yt£T 
Rs,t = Vs G 5 - G T 

/(i,i) = h yt e T over dotted edges in Figure [T] 

fp = ype:Pt-{{i,t)},tGT. 

Clearly, (/, R) is feasible for the instance {G, c, d, 3?§-w)- We claim that (/, R) is a Wardrop flow-rate for the 
instance (G, c, d, ^, when < 1 + G2. To see this, first note that (/, R) satisfies the Conditions 
(1) and (2) in the definition of Wardrop flow-rate (Definition [S]) for the instance (G, c, d, Ol$y^, <I>). We 
will now check the conditions (3) and (4) in Definition [3] Note that ^e^t{xe) = whenever Xe,t = x for 
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alH G T for some x > and by continuity this is true even if x = 0. Therefore, 

ee{(l,t)} 



C{l,u,v,t){f) - X] 



Ce(Ze)*e,t(iCe 



e&{il,u),iu,v),{v,t)} 



lim 

X — 



X . (I/Nt) ^ C2X . {1/Nt) ^ xA^ 

X XX 



= 1 H T7— J and similarly 

Clearly, the condition (3) is satisfied as C(i,t)(/) < C(i,«,i;,t)(/)- Also, 

dCf{R) 1 /, ^ ' , ^ 



2/m 



2Cih 



7V2 



as m — ^ 00 and 



dRs,t 

Therefore, when < 1 + C2, we get 



> 0,Vse5-{l}. 



Vs G S - {1} 
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which implies that the condition (4) is also satisfied. Thus, (/, R) is indeed a Wardrop flow-rate for the 
instance {G, c, d, 3isW) ^) Further, 

c{f,R) = Yl ^-(-^-)+ E ^^(^^) 

eeUtgT{{l,t)} eeu„gs{{s,w)} 

+ C^u,v)iZ(u,v)) + J2 Ce(Ze) + XI ^^^V^) 

eeUteT{(v,t)} seS 

= Nrh + + + + Ci {NThr^f'"^ 
= Nxh + Cih?' as m — > oo. 



Now let us consider another flow-rate (/*, R*) 

=0 Vie r, and 

/(W) = Ws ^^e-^'^e^- 

Clearly, (/*, i?*) is feasible for the instance (G, c, d, ^R§w). Further, 

C{f\R*) = J2 E Ce«) + C(,,,)(Z(*,_,)) 

eeUtgT{(l,t)} eeu„es{{s,w)} 

+ 5] Ce{zl) + J2ds{y*s) 
eeUteT{{v,t)} sgS 

= O + NsiNTi—r) +C2(iVr/i")'/" 

= h{l + C2+NT) + ^ 

J\s 

as m — > 00, n — > oc. 

Thus, when < /i (1 - ^), we have C(/*, i?*) < C(/, i?). As OPT{G, c, 3?sw) < C(/*, i?*), 
this implies that the POA is greater than one. 
In particular, 

^1 + ^ 



+NTh + NsCi[NT{— 



p(SaH,ei,D2,*,$,Mc) > 



h JVs 



Now, take h = l,Ns = Nt > 4^,1 + C2 = 3Nt, Ci = iV|., and note that 

2Cih 



Nt 

as well as. 



2Nt < SNt = 1 + C2, 
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Figure 1: Example of a network where POA is linear in Nt- 




ti 



Figure 2: Classical Slepian-Wolf network with appropriate costs also has POA > 1. 
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Therefore, we get 

1 + Nt 
5 

This is near tight as will be evident from Theorem |20l 

To establish (4), we will prove a stronger result, p(9(is«;, C3, D3, 'I', Mc) > 1, by constructing an 
example as described below. As shown in Figure [2j there are two sources and two terminals which are 
directly connected to each source. Both sources are identical with entropy 1, di{y) = Ciy'^, ^2(2/) = C2y^ 
with Ci, C2 > 0, Ci 7^ C2 and Ce(x) = for all edges. We now outline the argument that shows that the 
POA > 1. 

First, observe that the instance is symmetric with respect to terminals and all cost functions are strictly 
convex. Therefore the OPT flow rate for the instance, denoted {f*,R*) is such that R^t-^ = R*st2 
s = 1,2. Next, by the characterization as per Theorem [TSl the Wardrop flow -rate, denoted {f,R) is an 
OPT flow-rate for Ce(x) = with the source cost functions remaining the same. This new instance 
with Ce{x) = is also symmetric with respect to the terminals and the cost functions remain strictly 
convex. Therefore we conclude that for the Wardrop flow-rate as well Rs,ti = Rs,t2 for s = 1,2. Let 
Ri,ti ~ — h and R\ = R^^^ = h*. Using the properties of Wardrop flow-rate and OPT flow rate 
as per Condition (2) in Theorem [Til we have i?2,ti = ^2,12 = I — h and R2 = Rl = I — h*. We 
argue below that h ^ h*. Consequently, by uniqueness of the OPT flow -rate (due to strict convexity of the 
objective function) we will have C{f, R) > C{f*, R*) implying p{9dsw, C3, '^3, ^, ^1 ^c) > 1- We have, 
fort = ti,t2, 

dd^\R) 1 J. . , ^ 

o „ 2 ^l,t 



Similarly, 



dR2,t 4 
By the definition of Wardrop flow-rate, we have 



^Ciyil/l ^2_^^^^ 

—Cih^ as m — > 00. 



Thus, 
Further, 

implies that 
Therefore, 



f{i,t) = h, f{2,t) = (1 - h). 



dcf{R) ..._ dcf{R) 

+ <-(l,t)(/) - + ^(2,t)U) 



dRit dR; 



2,t 



^Cih' + h' = ^C2il-hf + il-hf. 



h _ 1^2 + 1 



1 - V iCi + 1 
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Now, from Theorem[T8l {f*,R*) is a Wardrop flow-rate for the instance where everything remains the same 
except for the edge cost functions which are now ^x^ instead of and performing the similar calculations 
as above for (/, i?), we obtain 



1 - /i* V fci + 1 ' 

Clearly, since Ci 7^ C2, we get h ^ h*. In particular, take Ci = 4, C2 = 8, then h = 0.5695 and 
h* = 0.5635. Thus, C{f,R) = 1.9061, C(/*, i?*) = 1.9052 implying that POA > 1.004 > 1, in this 

example. I 

Note that while constructing the above examples the source cost splitting function we have used is 
^s,t{Ps) = 1/Nt- Further, for the same mechanism. Corollary [T9l2) provides an example of edge cost 
functions that gives a POA of one, and possibly this is the only choice giving POA one. Before considering 
another reasonable splitting mechanism, we first establish an upper bound which is nearly attainable by 
instance given in Figured] 

Theorem 20 Let Ze{xe) = (EteT ^It) " , ^e,t{^e) = n and <^s,t{Ps) = it- Then, 

Nt k 

p{9 all, Qk,1^ convex, "^,^,^0) < max{ — , — }. 

Proof: As in the proof of Theorem [TSl we have, Cp{f) = Yle&p^e{^e)z'e^t{^e) and Cp-{f) + 

Let (/, R) be a Wardrop flow -rate and (/*, R*) be OPT for (G, c, d, 3?§w) respectively. Further, let Ce{x) = 
Nt J ^dx = NtJ aex^'^dx = ^a^x^. Now, 

C{f,R) = ^CeiZe) + Y,dsiys) = ^ CeZ^^ + ^ 4(y.) 



e£E seS eeE seS 



and 



c{r,R*) = Y.^e{zt) + Y.dM) 
= j2^e{z:)'+Y,ds{y:) 



eeE seS 

Let us first consider the case where Nt > k i.e. 1 < 

C{f,R) = Y.aez': + Y,ds{ys) 

eeE ses 

eeE seS 

= ^Ce{Ze) +^ds{ys)■ 
eeE ses 
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Now, from Theorem [TSl (/, R) is OPT for (G, c, d, 3?sw) and because {f*,R*) is feasible for {G, c, d, 3?§w) 
we get 

Y,Ce{Ze)+Y,ds{ys) < ^Ce(z:) + ^4(y:) 



Nt 

C{f*,R*). 



SG5 



< 



k 

Nt 



k 



Therefore, 



C{f,R) ^ Nt 
C{f*,R*) - k ■ 

Similarly, for the case when Nt < k i.e. 1 > 

C{f,R) = Y.aezl + Y,ds{ys 

e€E s€S 



< 



k 

k 

_k_ 



Nt 



.ee-E 



se5 



.eg_E 



.eg_E se5 



Now, from TheoremfTSl (/, R) is OPT for (G, c, d, 3?§-w) and because (/* , i?* ) is feasible for (G, c, d, 
we get 

^Ce(2;e) < 

< 



Therefore, 

C{f,R) ^ k 
C{f*,R*) - Nt' 

Now we consider another splitting mechanism ^ that looks more like the edge cost splitting mechanism 
^. Specifically, take ys(p^) = (X]jgT(-^s,t)™) ™ and<i>j ^(pj) = ^ ^ '^^ . Let us first note the general- 
ization of Corollary [T9l l) for any source cost splitting mechanism Proof is esentially the same as before. 
The condition (2) in the definition of Wardrop flow-rate as well as OPT flow-rate renders all the rates to be 
equal to their corresponding entropies and consequently the condition (4) need not be checked. 



Y,Ceiz:)+Y,ds{y*s) 



eeE 



se5 



e&E 



seS 



e€E 



C{f*,R* 
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Lemma 21 Let Ze{xe) = C^t^T ^et) "> '^e,t{xe) = — „ N , and ^s,t{ps) source cost splitting 

function, then we have 

Now, we will argue that with y,(pj = (EteT(^'^,t)"') ^nd = J2 JiR.j)"' ^^^^ 

p(S(isto) Gmon, "^convex, ^7 3V[c) > 1 for large values of m and n. Let us consider the same example as in 
Figure [2]but with the new source cost splitting mechanism. First, note that OPT flow -rate is independent of 
the choice of cost splitting functions and the previously calculated OPT flow-rate for this instance {f*,R*) 
is given by 

= f{i,t) = h*, and 

n* J^* 1 L* 

^2,t — /(2,i) — — • 

We will argue that this is not a Wardrop flow-rate and since the OPT flow-rate is unique (by strict convexity) 
we will obtain POA > 1. After some simple calculations we get 

^^^^^"^^ - d'M)^'^lM + m^<^,M (1 - c^,,(pj) . 



Therefore, 



dRi,t Ri,t ' Ri,t 



= (m + ^){NTfl^^(h*f and 

dCf'{R*) x3/mC'2 



2 



^^^^ (^ + 3)(Ar^)3/™^(l-r) . 

Also, C(i,t)(/*) = {h*f and C(2,t){f*) = (1 - h*f- Note that A^t = 2 in this example. Now, with 
Ci = 4, C2 = 8, we have h* = 0.5635 and therefore 



(m + 3)(7VT)^/"' + l 0.5635^ 



2(m + 3)(iVT)^/" + 1 (1 - 0.5635)2 

1 0.5635^ 

2 (1 -0.5635)2 

= 0.8333 7^ 1 as m — > cxD. 

Theorem 22 L^.? Ze(a;e) = (Eter^t)"- = (Et6T(^s,t)™) ^e,t(=re) = ,^ ^ W 

t(P») = V" Yp — u^far large values of m and n, then we have 
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7 Future Directions 

In this work, we have initiated a study of the inefficiency brought forth by the lack of regulation in the 
multicast of multiple correlated sources. We have estabUshed the foundations of the framework by providing 
the first set of technical results that characterize the equilibrium among terminals, when they act selfishly 
trying to minimize their individual costs without any regard to social welfare, and its relation to the socially 
optimal solution. Our work leaves out several important open problems that deserve theoretical investigation 
and analysis. We discuss some of these interesting problems in the following. 

Network Information Flow Games: From Slepian-Wolf to Polymatroids: It is interesting to note that 
all the results presented in this chapter naturally extends to a large class of network information flow prob- 
lems where the entropy is replaced by any rank function (ref. Chapter 10 in flOl ) and equivalently con- 
ditional entropy is replaced by any supermodular function. This is because the only special property of 
conditional entropy used in our analysis is its supermodularity. Polytopes described by such rank functions 
are called contra-polymatroids and the SW polytope is an example. Therefore, by abstracting the network 
coding scenario to this more general setting, we can obtain a nice class of multi-player games with compact 
representations, which we call Network Information Flow Games. It would be interesting to study these 
games further and investigate the emergence of practical and meaningful scenarios beyond network coding. 
Furthermore, the network coding scenario where the terminals do not necessarily want to reconstruct all the 
sources should also be interesting to analyze. 

Dynamics of Wardrop Flow-Rate: Can we design a noncooperative decentralized algorithm that steers 
flows and rates in way that converges to a Wardrop flow-rate? What about such an algorithm which runs in 
polynomial time? A first approach could be to consider an algorithm where each terminal greedily allocates 
rates and flows by calculating marginal costs at each step. The following theorem, which follows from an 
approach similar to that in the proof of Theorem [TTl provides some intuition on why such a greedy approach 
might work, as per the relationship between Wardrop and OPT according to Theorem [TSl 

Theorem 23 Let (f, R) be an OPT flow-rate for instance {G,c,d,'Jls'w) and define hg^t '■= d's{ys)y's tiPs)~^ 
Xs^tfor s (z S,t & T, where Xs,t'^ dual variables satisfying KKT conditions^ E] Further, let a : 
T X S — > S be defined such that < /io-(t.i),t < ^a-(t,2),t < • • • < hf^(t,Ns),t- Then, 



Better bounds on POA: Although we have provided explicit examples where correlation brings more 
anarchy, as well as, an upper bound on POA which is nearly achievable, we believe that more detailed 
analysis is necessary. An important approach in this direction would be to characterize exactly how the 
POA depends on structure of SW region i.e. to analyze the finer details on how correlation among sources 
changes POA, even in the case of two sources. Further, other interesting splitting mechanisms should also 
be studied. 

Capacity Constraints and Approximate Wardrop Flow-Rates: One immediate direction of investiga- 
tion could be to consider the scenario where there is a capacity constraint on each edge i.e. the maximum 
amount of flow that can be sent through that edge. Another interesting problem is to investigate the sensitiv- 
ity of the implicit assumption in our analysis that terminals can evaluate various quantities, and in particular 
the marginal costs, with arbitrary precision. This can be achieved by formulating a notion of approximate 
Wardrop flow-rate, where terminals can distinguish quantities only when they differ significantly. 
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